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Abstract 

We analyze the two-player game of Knock 'em Down, asymptotically as the number of tokens 
to be knocked down becomes large. Optimal play requires mixed strategies with deviations of 
order y/n from the naive law-of-large numbers allocation. Upon rescaling by \fn and sending 
ti — ► oo , we show that optimal play's random deviations always have bounded support and 
have marginal distributions that are absolutely continuous with respect to Lebesgue measure. 
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1 Introduction 



1.1 Knock 'em Down 

In the game of Knock 'em Down, a player is given n tokens which (s)he arranges into k piles, or 
bins. After that, a fe-sided die is thrown; if the outcome is side i (which occurs with probability 
Pi > 0), then a token is knocked down from the i th pile. In the event that there are no tokens in 
the i th bin, then no tokens get knocked down. The die is thrown repeatedly un til all th e token s 



have been knocked down. [In the original version of this game as described in }Hun98| BF99J], 
n = 12 and two fair six-sided dice are thrown, with the bin chosen being given by (1 less, let us 
say, than) the sum of the two numbers showing. In that case, k = 11 and pi = (6— \i— 6|)/36. The 
reader may wish to note that we have altered the spelling from "Knock 'm Down" to "Knock 'em 
Down".] 

We consider two versions of Knock 'em Down. In solitaire Knock 'em Down, which we analyze 
in a separate article, there is one player, and his goal is to minimize the expected number 
of iterations until all the tokens have been knocked down. Solitaire Knock 'em Down is also 
equivalent to a two-player zero-sum game, where the payoff to the winner is the expected number 
of extra die throws that the other player requires to knock down all his tokens, and the goal is 
to maximize the expected payoff. In competitive Knock 'em Down, which we analyze in this 
article, it is enough merely to win, and the amount by which a player wins is irrelevant. There 
are two players, who each arrange their tokens into bins without seeing what the other player is 
doing, and then the same die is used to knock down tokens from each of the players' bins. The 
winner is the player whose tokens get all knocked down first (the outcome may be a tie, which 
we resolve with a coin flip). With competitive Knock 'em Down, as we shall see, there is an 
interesting Nash equilibrium. 

Co mpetit ive Knock 'em Down is quite easy to analyze when k = 2. The result is Theorem 1 
m |BF0d | : the authors show that the best strategy is to use the allocation (m, n — m), where m is 



a median of the Binomial(n,pi) distribution. It is instructive to consider competitive Knock 'em 
Down in the next simplest case, in which a fair three-sided die is used. The first player may guess 
that the best strategy is to place n/3 tokens into each of the three bins (assume for convenience 
that n is divisible by 3). But if the first player uses that strategy, then the second player can 
"undercut" by placing (n/3) — 1 tokens into each of the first two bins and (n/3) + 2 tokens into 
the third bin. Then, for large n, the probability that the second player's third bin empties out 
last is only slightly larger than 1/3, while the probability that the first player's first or second 
bin finishes last is only slightly smaller than 2/3, so that the second player wins about two-thirds 
of the time. It turns out that an optimal strategy is to allocate approximately n/3 tokens to 
each bin, but with certain random perturbations to the bin allocations. (See Figure [TJ) In 
game-theoretic terminology, optimal play employs a mixed (non-pure) strategy. 

To simplify the analyses of both games for general k and p := (p±, . . . ,Pk), we suppose that the 
games are run in continuous time, with the die being thrown at instants governed by a Poisson 
point process with rate 1. Throwing the die at random times rather than at deterministic times 
has absolutely no impact on the outcome of a game of competitive Knock 'em Down; likewise, 
in the case of solitaire Knock 'em Down the expected (total) clearance time (i.e., time to knock 
down all tokens) remains unchanged. Suppose in either game that a player places £j tokens 
into bin i, and let Tj denote the time that it takes for bin i to be cleared. Since the game is 
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Figure 1: An optimal strategy for (an approximation to) the continuous limiting version of 
two-player Knock 'em Down described in Section [1,31 when k = 3 and p = (1/3, 1/3, 1/3). The 
strategy chooses a hexagon with probability proportional to its darkness, and then allocates the 
chips according to the hexagon's coordinates. Hexagons near the lower-left, lower-right, or top 
of the triangle allocate more chips to the first, second, or third piles respectively. The top corner 
has coordinates (-0.28, -0.28, +0.56)/v / 3. 
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run in continuous time, the Tj's are mutually independent, and the variable p{Fi is the sum 
of £j independent exponential random variables with unit mean, i.e., p{Fi is a Gamma random 
variable with shape parameter £j. The clearance time is T = maxj Tj. 



1.2 Asymptotic scale: root-n deviations 

Partial resul ts concerning optimal p lay ha ve been derived by Art Benjamin, Matt Fluet, and 
Mark Huber |BF99l ] |BFOO( | [BFHOiI ] |F!u99I | for both versions of Knock 'em Down, but in neither 



case has optimal play been characterized analytically. For both versions, we focus here on the 
asymptotics of optimal play when the die remains fixed [i.e., the vector p is held constant] and the 
number of tokens n becomes large. While optimal play in competitive Knock 'em Down requires 
a mixed strategy, it is useful to consider first what happens when a player deterministically 
places & tokens into the i th bin (1 < i < k). If ^ is large, then by the central limit theorem, 
Tj is approximately normal with mean (,i/pi and variance Ci/pf- I n particular, bin i is emptied 
by time Tj = (d/pi) + O p (\/ r ^i/pi). (We have used here the usual notation X n = O p {y n ) to 
mean that inf ra Pr[|X n | < cy n ] tends to 1 as c — > oo; equivalently, one says that the family of 
distributions C(X n /y n ) is tight.) 

If £j = nqi, where the q^s are nonnegative and sum to unity, then the clearance time T satisfies 
T = nm&Xi(qi/pi) + O p {^/n). Since maxj(c/j/pj) > 1, this suggests t hat the o ptimal allocation 



is £i ~ npi, and indeed for the solitaire game this was first shown in [BFHOll ]. We show in our 
companion paper on solitaire Knock 'em Down that the optimal choice is £j = npi + 0(y / n) for a 
judicious choice of the 0(\/n) perturbations. For competitive Knock 'em Down, our Theorem ll.2l 
below shows similarly that an optimal mixed strategy will never choose £j differing from npi by 
more than order y/n. 

Define the overplay of an allocation £ relative to np to be 

(Hi 

max n 

i \Pi 

In the following lemma, {£ beats if} is the event that the corresponding clearance times satisfy 
the strict inequality T? < Tfj, or else = Tfj and £ wins the coin toss. The notation extends 
naturally to {a beats $} for mixed strategies a, /?; see the start of Section[2]for careful analogous 
definitions in the continuous-game setting that we introduce in Section 11.31 

Lemma 1.1. If the overplay of allocation fj exceeds the overplay of allocation £ by at least 
wy/kn I miuiPi, then Pr[ff beats |j < 1/w 2 . 

Proof. Let j be the bin maximizing rjj/pj. In order for allocation fj to win, there must be some 
bin t 7^ j for which ff s bin j clears out before £'s bin £. The clearance time To of ffs bin j has 
mean T]j/pj and variance rjj/pj < n/pj, while the clearance time Ti of £'s bin £ ^ j has mean 

€e/pe — (Vj/Pj) ~ (wy/kn/ voixiiPij and variance < Ci/Pe- Since T and are independent, by 
Chebyshev's inequality 

(n/pj) + (Ce/pj) ^ 1 + (Ce/n) 



Pr[0 < T t - T ] < J- . \ 2 < , 
[w\/kn/ mmiPi) w ft 
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and since the support of T(_ — Tq is M, the first above inequality is strict. Upon summing over 
£ / j, we find that the probability that there is some bin £ ^ j for which To < Tg is less than 
1/w 2 . □ 

Theorem 1.2. No optimal competitive Knock 'em Down (mixed) strategy ever overplays by 
more than ( 3 y 3 + \fh\fhn + 1 ) /minjpj. 



Proof. Let a be any optimal strategy, and let a{x) be the probability that strategy a picks an 
allocation with an overplay of x or more. We show successively, in each case using Lemma 
that 

fwVkn+l\ [2wVkn + l\ ( 3w\fkn + l\ 

a \ : ] <Pi, a \ : < p 2 , a : =0, 

y muii pi J y mimpj J y raiiiiPi J 

for w = y/3 + y/E, Pi = = k 2 ^, and p 2 = ^fi. Let p = 1 - (1/w 2 ) = (1 + y/E)/4, 

Suppose first that a((w^fkn + 1)/ minj^j) > pi. Consider pure (i.e., deterministic) strategy (3 
which always plays np (rounded to an integer vector summing to n). Because of the rounding, 
(5 may overplay np slightly, but never more than 1/ minj^j. By Lemma ll,14 strategy (3 beats a 
with probability > pi X p = 1/2, contradicting the optimality of a. 

Suppose next that a((2wykn+l) j rn.va.iPi) > P2- With positive probability (> 1—pi), strategy a 
overplays by no more than (wVkn + 1)/ mhijpj, so we can define a strategy [3 which plays 
according to strategy a conditioned to overplay by no more than (wykn-\- 1)/ minj^j. When j3 
plays against a, by Lemma [TTTl (3 wins with probability > (1 — p\) x \+p 2 x p = 1/2, which is 
again a contradiction to the optimality of a. 

Suppose finally that a((3wV~kn + l)/minjj)j) = 5 > 0. Let be the strategy which plays 
according to a conditioned to overplay by no more than {3w^/hn + 1)/ minjj>j. When (3 plays 
against a, by Lemma [L~H strategy [3 wins with probability > [(1 — 5) x i] + [5 x (1 — p 2 ) Xp] = ^, 
which is again a contradiction. □ 

1.3 A continuous game 

From Theorem 11.21 we see that any optimal strategy for competitive Knock 'em Down plays 
allocations deviating by O p (y / n) from the naive law-of-large- numbers allocation np. For given n 
and allocation £, it is thus natural to define numbers Xi so that £i = pin + Xi^/n. Then 

Ti ~Gamma(^, l)/p; 

~ Normal ( n H — -\^n, 1 — ^ \fn 

\ Pi Pi Pi 

~ n + Normal f — , — ) \pn. 

\Pi Pi) 

(By ~ we mean that the total variation distance between the two distributions tends to as 
n — > oo with pi fixed and Xi bounded.) The player chooses the x^'s so that 



it 
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and pin + Xi^/ri is an integer, and the player's tokens are all exhausted at time approximately 



n + v n max h 



Pi VP* 

where the Z^s are independent standard normal random variables. 

Thus the large-n asymptotics of either version of Knock 'em Down is effectively a continuous 
game (called solitaire/competitive continuous Knock 'em Down), where the first player chooses 
real numbers x% satisfying Y2i X{ = 0, and his clearance time is 

< i x i , Zi 
Ls = max 1 



Pi ^Pi 

The second player similarly chooses numbers summing to 0, with clearance time Tg defined us- 
ing the same Zi's. The sequel provides various rigorous connections between n-token Knock 'em 
Down and our continuous game. 

We present here two indications that even qualitative analysis of the continuous game is not 
entirely trivial. First, the naive pure strategy x = has no optimal response. Indeed, the 
responding player can undercut by playing (— e, — e, . . . , +{k — l)e), and letting e I provides 
strategies which give the respondent asymptotically the optimal probability {k— l)/k of winning, 
but no strategy achieves this probability. Second, it is not immediately clear that our two-player 
continuous game has a value (in the game-theoretic sense). The re are s tandard tools for proving 



that a continuous game has a value, such as results of Ky Fan [Fan53l | . but our payoff function 



is rather severely discontinuous at certain points an d the t ools require a semicontinuous payoff 



function. Continuous games without values do exist [SW57I ] . but it turns out that our game does 
indeed have a value. One can prove this by a suitable comparison of our game with a "ties go 
to player 1" modification of the game having upper semicontinuous payoff function, but we will 
give a somewhat more direct proof whose basic idea is simply to pass to the limit from n-token 
optimal strategies. 



1.4 Guide to later sections 

With solitaire continuous Knock 'em Down, the optimal strategy is deterministic, and we are 



able to characterize it for general p^s FW 06] . With competitive continuous Knock 'em Down, 
optimal play is random (i.e., mixed) with a rather complicated distribution (see Figure [1]). Even 
in the simplest nontrivial instance, where k = 3 and p = (1/3,1/3,1/3), we are unable to 
calculate an optimal strategy. However, for general k > 3 and p, we are able to derive some 
basic results about optimal play. In Section [2] we show, for example, that any optimal strategy 
for the continuous game has absolutely continuous marginals. In Section [3] we prove that the 
continuous game has an optimal strategy. In Section 2] we show that a good strategy for the 
continuous game can be converted to good strategies for the n-token games by rounding; in 
particular, an optimal continuous strategy can be converted to asymptotically optimal n-token 
strategies. Finally, in Section [5] we list some open problems arising from our work. 
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2 Properties of optimal play 



The main result of this section (Theorem 12. 5p asserts that the marginals of any optimal strategy 
for the continuous game described in Section \1. 31 are absolutely continuous. This will follow from 
an estimate which holds (see Lemma l2.4p equally well for the discrete game. 

We begin by establishing some terminology for the continuous game; analogous terminology will 
be employed for the discrete game. Recall that 



is the clearance time for a player using allocation x = (x%, . . . , Xk) with x\ H + Xf~ = 0. Here 

Z\, . . . , Zk are independent standard normal random variables. Recall that if player 1 (say) uses 
allocation x and player 2 uses allocation y, then player 1 wins if and only if Tg < Tg] for short, 
we simply say a x beats y ", in which case player 2 pays 1 unit (utile) to player 1. 

The usual game-theoretic analysis takes the payoff K(x, y) to be the expected amount won by 
player 1, namely Pr[x beats y\ — Pr[y beats x]; for convenience we instead define K(x,y) to be 
half this difference. 

For any two mixed strategies a and (3, Pr[a beats 0\ = JJPr[x beats y\a{dx) (3{dy) and we 
extend the definition of K to 



We say that a dominates (3 if K(a, (3) > 0. A strategy a is optimal if no mixed strategy 
(equivalently, no pure strategy) dominates it. 

Notice that payoffs are unaffected if we flip a fair coin to decide ties. That is, in the above 
definitions we may (and do) without effect redefine Pr[x beats y\ as Pr[T^ < T$]. Here we have 
introduced the convenient notation PrL4 < B] to mean Pr[A < B] + ^Pr[A = B]; similarly, 
PrL4 < B < C] is shorthand for PrL4 < B < C] + ±Pr[A = B < C] + ±PrL4 < B = 
C] + 4 PrL4 = B = C]. (In our use of "Pr[^4 < B < C]" below, the third term always vanishes 
because Pr[A < C] = 1.) 

It will be convenient to view the contest between allocations x and y as follows. Let := 
max(xj,yi) for 1 < i < k. For convenience we will refer to m = (mi, . . . ,mfc) as an allocation, 
even though we are now working on the y/n-scale of deviations and the sum mi + • • • + may 
exceed 0. Correspondingly, on the scale of tokens, define Mj := p^n + rriiy/n. Let I = I^be the 
bin that is cleared last when the bin sizes are M%, . . . , [In the continuous game, I is defined 
correspondingly as argmaxj(— + -fM, where the Z^s are independent standard normals. 1 If 

Pi \/Vi 

x i > Uli then y wins; if x\ < yj, then x wins; and if xi = yj, then the game is resolved by a coin 
flip. (Thus, overall, Pr[x wins] = Pr[x/ ^ Hi]-) To compare various strategies, we will couple 
the i^'s for various values of rh, taking the viewpoint that the same random sequence of die 
tosses will be made for a given game regardless of the allocations x and y that the two players 
use (in the continuous game, that the same vector of Z^s will be used). Then increasing mi 
while leaving the other m,-'s (j ^ i) fixed may change I from a value different from i to i, but 
otherwise I will not change. 





(2.1) 
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Lemma 2.1. If fh is incremented by l/y/n in position i, then 



Pr[I changes] < (1 + 0(1))/^ = 0(l/^/n). 

Proof. Let M := Mj, so that M is incremented by 1 to produce from fh a new allocation fh' . 
Ignore bin i for the moment, and let T := maxj^Tj be the time at which all remaining bins 
are cleared for allocation fh (or fh'). In order for 1^ / L^i, it must be that bin i was selected 
exactly M times up through time T. Conditional on T, the number of times bin i was selected 
is Poisson distributed with parameter A := Tpi, and the probability of exactly M selections is 

e -A A M e ~M M M 1 
< : < 



M\ ~ Ml ~ V 2=JI~ 



Thus (unconditionally) Pr[I changes] < l/y/2irM. We use this bound when n is large and (say) 
M > pin — n 2 / 3 . When M is smaller, we instead use Pr[J changes] < Pr[J^/ = i], which is 
o(l/y / n) (indeed, is exponentially small in n). □ 

Lemma 2.2. For fixed p, the distribution of 1^ is "locally flat" as a function of fh: for bins h, 
i, and j (not necessarily distinct), with eh and denoting unit vectors, 

Prfe = j] + Pr[/^ + ^ + ^ = j] = = j] + Pr[I A+ ^ = j] + 0(l/n). 

Proof. Let F t (M) denote the probability that the game with allocations given by M has clearance 
time < t. If Ym denotes the number of times that bin t has been selected through time t, then 
the Yi's are independent Poisson processes with respective intensity parameters pg. We have 

k 

F t (M) = Pi[A k e=1 Ye it > M e ] = JJPr[Poisson(p^) > M e ]. 

l=\ 

The probability that the clearance time falls in (t, t + dt) with bin j the last bin cleared is 

[F t (M - ej) - F t {M)\ pj dt = AjF t (M - ej ) Pj dt, 
where ej is the j th unit vector and Aj denotes the difference operator 

A j f(x)=f(x)-f(x + e j ). 
Letting M' = M — ej we therefore have 

Pr[I a =j]= Pj / AjFt(M') dt, 
Jo 

where here we are viewing / = 1^ as a function of M = pn + fh*Jn. In this notation, the desired 
bound we will establish is 

A h Ai Prf/^ = j] = Pj / AhAiAjFt(M') dt = 0{l/n). (2.2) 
Jo 
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Since we keep pj fixed as n — * oo, we see from (|2.2[) that we may treat h, i, and j symmetrically. 
Thus there are three cases to consider: (i) h, i, and j all distinct, (ii) h = i ^ j, (iii) h = i = j. 
Let us consider first the case (i) of distinct h,i,j: 



poo 

A h A i Pi[I Ai =j]= / Pr[Y h>t = M h ) Pr[Y i;t = M i 
Jo 



Pi[Y j>t = Mj - 1] Pj [ H Pr[Y tjt > M(\ ) dt, 



| A„A, Pr| / 1? = ,1| < fc^-j^g- PrK,, = M, - 1] „ M 



1 



1 



^2nM h y/27rWi' 

If both and Mj are of order n (as when good strategies are used), then we obtain the desired 
bound of lA^AjPr^jg = j]\ = 0(l/n). Otherwise, if (say) Mj = o(n), we may instead use the 
bound 



AiPr^ = j}\< Pr[/^ +ei + l A \ < Pr[J 



M+ei 



which is of much smaller order than 1/n. Thus [A^Aj Pr[i^ = j] \ = 0(l/n) in any case. 
Now suppose (ii) h = i ^ j. In this case 



A t A,Pr[/, 7 =j] 



Pr[F M = Mi] - Pv[Y ht = Mi + 1]) 
x Pr[^-, t = Mj - l] Pj H Pv[Y £ j > Mi] ) dt, 



/•oo 

|AjA 4 Pr[J^ = j}\ < / | Pr[Y M = Mj] - Pr[Y ht = M + 1]| 
Jo 



x Pr[Y i)t = Mj - l]pjdt 

< sup | Pr[Y itt = Mi] - Pr[Y i>t = M + 1] 
t 



Let A ■= Pit and M := Mj + 1. For fixed M, the expression 



A 



M-l 



M! 



-A 



A 



M-l 



(M-l)! 



1 



A 

M 



(M-l)! 

is easily seen to be maximized when A is one of the values A = M =p \/M, and so 

|AjAjPr[/, 1 =j]| < ' ' 



y/2^M~ y/Mi + 1 ' 

As in case (i), we conclude that |AjAjPr[/^ = j]\ = 0(l/n). 

In case (iii) h = i = j, the desired bound follows from the equality 

A*A, Pr[/ Al = j] = AjAjl = 



and the bounds for case (ii) h = i 7^ j. 



□ 
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Define the 5-undercut of x to be the mixed strategy that increments x by (k— 1)5 in a uniformiy 
random coordinate, and decrements x by <5 in the remaining coordinates. (In the discrete setting, 
5 will be a multiple of 1/y/n.) The preceding lemma implies 

Corollary 2.3. Uniformly in allocations x and y such that \xi — y%\ > (k — 1)5 for each coordi- 
nate i, 

| Pr [5-undercut of x beats y\ — Pr[x beats y\\ = 0(5 2 ). 

Proof. We prove this in the discrete setting; the result for the continuous setting follows by 
taking limits. We assume without loss of generality that the bins are numbered so that Xi > yi 
for 1 < i < £ and X{ < yi for i + 1 < i < k. Recall that rh = max(x, y). Let x' denote the 
random (5-undercut of x, and fn! = max(x',y). We have 

k 

Pt[x beats y\ = ^ Pr[i^ = j], 

j=£+l 

and because x and y differ by more than {k — 1)5 in each coordinate, 

k 

Vi[x beats y\ = ^ Pr[Im' = j}- 

j=£+l 

For each coordinate i, E[m^] = m;, and \m' i — uii\ = 0(5), so the corollary follows from the 
local- flatness property we proved in Lemma [2.21 □ 

Lemma 2.4. For any fixed p with k > 3 bins, for any coordinate i, for any 5 > 1/y/n, and for 
any optimal strategy a, if x and y are independent draws from a, then Pr[|a;j — yi\ < 5] = 0(5), 
where the constant implicit in 0(5) depends only upon p. 

The same proof works for both the discrete and continuous versions of the game. 

Proof. We construct a strategy (3 which attempts to beat strategy a by undercutting it, to wit: (3 
picks an allocation x from a, but, rather than playing x, strategy (3 instead plays the (5-undercut 
of x. By analyzing how (3 fares against a we will be able to bound Pr[|xj — yi\ < 5]. 

When (3 and a are pitted against each other, we will take the viewpoint that x and y are 
independently drawn from a and a fair coin is flipped; if the coin lands heads, then a plays x 
and (3 plays the J-undercut of y, while if the coin lands tails, then a plays y and j3 plays the 
(5-undercut of x. Recall that / is the last bin to be cleared when the allocations are x and y. 
Without the undercutting, bin / is owned by (3 with probability 1/2. (We say that a player 
"owns" the last bin to be cleared if he was the player who placed more chips in that bin, or lost 
the coin toss in the event of tie.) Let I 1 be the bin last to be cleared with the undercutting. 

Letting E be the event that \xi — yi\ < (k — 1)5 for some coordinate i and E c its complement, 
we may express 

Pr[a beats 0] = Pr[{a beats (3} n E] + Pr[a beats (3 \ E c \ Pr[£ c ] 
< Pr[{a beats 0} n E] + (| + 0(5 2 )) Pr[E c ] 
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by Corollary 12,31 The optimality of a implies 

< Pr[{a beats (3} P\ E] - \ Pv[E] + 0(5 2 ). (2.3) 

Now, conditioning on x and y, 

Pr[a beats f3\x,y\ = Pr[/3 owns /' ^ 1 1 x, y\ + Pr[/3 owns /' = I \ x, y\ 
< Pr[I 1 1 x, y\ + Pr[/3 owns / 1 x, y]. 

By Lemma 12.14 Pr[/' ^ I \ x,y] = 0(5). To compute Pr[/3 owns / 1 x, y\, we condition on /. For 
example, conditionally given the event 5 < \xj — yj\ < (k — 1)5, the player using (5 owns / with 
probability 1/2 if the "overcut bin" [the bin with allocation incremented by (k — 1)5] chosen is 
not bin I and with probability 1 if it is. Thus, if 5 < \xj — yj\ < (k — 1)5, then 



Pr[/3 owns I \x,y, I] 



1 k) + k 



2 ^ 2k- 



The other entries in the following formula are computed similarly: 



Pr[/3 owns I\x,y, I] 



Thus, conditional on x and y but not /, 
Pr[a beats j3 \ x, y[ — - = 0(5) 



f 1 

2 




if 


- yi\ 


> (k — 1)5 


1 + 


1 

'Ik 


if 1^7 


- yi\ 


= (k — 1)5 


\ + 


1 

2fc 


if 5 < 


\xi - 


- yi\ <(k- 1)5 


\ + 


3 

■Ik 


if 1^7 


- Vl\ 


= 5 


1 

I fe 




if \xi 


- yi\ 


< 5. 



+ ^Pr[<5 < \xj 



< 



(k- l)5\x,y\ 



2k 



Pt[\x! -y/| <5\x,y\, 



and so unconditionally 



Pr[{a beats (3}nE}-~ Pr[E] = 0(5) Px[E] 

+ ^Pr[5< \xi 

Substituting this into (|2.3p . and then rearranging, 



< 



(k - 1)5] 



2k 



■Pr[\xi- yi \<S\. 



Pr[\ Xl - yi \ < 5] < 0(5) Pt[E] + 0(5 2 ) + ^ Pv[5 < \xj 



VI 



< 



(k - 1)5], 



(k - 1) Pr[\ Xl - yi \ < 5] < 0{5) Pv[E] + 0(5 Z ) + Px[\ Xl - yi \ < (k - 1)5] 



Recall from Theorem 11.21 that we have 0(^/n) bounds on the overplay or underplay of the 
optimal strategy a. It follows that there is a positive constant q (depending on p) such that 
Pr[J = i | x, y\ > q for any coordinate i and plays x and y that a might make. Recalling also 
that E is the event that \xi — yi\ < (k — 1)5 for some coordinate i, we see that 

Pr[E] < - Pv[\ Xl - yi\ < (k— 1)5]. 

q 
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Letting r := k — 1, we have then that, for some c and all 5 > 1/y/n, 

Pr[|xj - y/| < 5] < co~ 2 + Hi-PrNxj - yjl < nfl. 

r 

When r = 1 (i.e., k = 2) this inequality is uninformative, but otherwise we may iterate it to 
show, for j > 1 and <5 > 1 /\/^> 

Pr[\xj - yj\ < S] ^ c[§2 + + + 1_ ^ _ ^ < ^ 



(1 + c<5) x • • • x (1 + crj~ l 5) ~ 1 1 ri 

We take j = [log(l/<5)/logr] so that (1 + cS) X ■ ■ ■ X (1 + cr- 7 " 1 ^) = O(l) and both terms on the 
right-hand side are O(^). Thus 

Pr[|xr - < <5] = 0(5). 
Recalling again that Pr[|x/ — yj\ < 5] X Pr[Uj{|xj — < 5}] yields the lemma. □ 

Theorem 2.5. For competitive continuous Knock 'em Down with k > 3 6ms, eac/i marginal 
distribution of any optimal strategy is absolutely continuous with respect to Lebesgue measure, 
and has a square-integrable density function. 

Proof. If X and Y are i.i.d. draws from the marginal distr ibution , then, by Lemma \2A\ Pr[|X — 
Y\ < 5} = 0(8). From this it is straightforward (see, e.g., Mat95l . Theorem 2.12]) to check that 
the distribution is absolutely continuous with respect to Lebesgue measure, and that the density 
function is square-integrable. □ 

Note that the support of any measure with nonzero absolutely continuous part has positive 
Lebesgue measure, and that any subset of the line with positive Lebesgue measure has positive 
1-dimensional Hausdorff measure. Thus we have 

Corollary 2.6. For k > 3 bins, the support of any optimal strategy for continuous Knock 'em 
Down has Hausdorff dimension at least 1. 

It is natural to guess that the true Hausdorff dimension is k — 1, and indeed that optimal 
strategies are absolutely continuous with respect to (k — l)-dimensional Lebesgue measure. 



3 Existence of an optimal continuous strategy 

While every game in which each player has a finite number of options will have a value (which 
of cou rse is for n-token competitive Knock 'em Down), there are continuous games without a 



value SW57I ]. Thus Theorem 13.11 below has nontrivial content. 

Recall the definition of K at (|2.ip . For the n-token game, we regard a mixed strategy a n as a 
probability measure on A;-tuples x = (xi, . . . , Xk) with vanishing sum, as described in Section CH 
and we define the payoff function K n on this x-scale. We say that a sequence (a n ) of strategies 
for the n-token competitive Knock 'em Down games is asymptotically optimal if 

min K n (a n ,y) — > as n — > oo. 

yeA n 

Here the min is taken over the finite (but growing, as n — ► oo) set A n of (normalized) actions 
(allocations) available for n-token Knock 'em Down. 
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Theorem 3.1. The continuous game has value 0, and there is at least one optimal strategy. 
Indeed, any subsequential weak limit of any asymptotically optimal sequence (a n ) of strategies 
for n-token competitive Knock 'em Down is an optimal strategy for the continuous game. 

Key ingredients to the proof of this theorem are Theorem 11.21 and Lemma 12.41 The converse to 
this theorem is proved in Corollary 14.31 



Proof of Theorem \3.1[ By Theorem 11.21 and Lemma 11.11 any sequence (a n ) of asymptotically 
optimal strategies (probability measures) i s tight . Therefore there is a subsequential weak limit 



a which is a probability measure (see e.g. [Str93l . Theorem 3.1.9]). It is easy to check that a is 



concentrated on fc-tuples with vanishing sum, so that a is a mixed strategy for the continuous 
game, and (using Lemma l2.4p that a has atomless marginals. We claim that a is optimal, and 
then we see immediately that the continuous game has value 0. 

The proof that a is optimal is rather routine. To avoid double subscripts, we henceforth in- 
nocuously assume that the full sequence (a n ) converges weakly to a. Fix any pure strategy y 
for the continuous game. Let y n denote a rounding of y to a pure strategy for K n ; the details 
of the rounding procedure are irrelevant for our purposes here, as long as y n — > y. Using the 
facts that a has atomless marginals, a n — > a, K is continuous away from pairs (x, y) for which 
Xi = yi for some i, and (for any e > 0) \K n (x,y n ) — K(x,y)\ — ► uniformly for x satisfying 
\%i — Ui\ > £ for i = 1, . . . , k, it follows (we omit the details) that K n (a n , y n ) — > K(a, y). But by 
the optimality of a n , K n (a n ,y n ) > for every n, so K(a,y) > and a is optimal. □ 

Remark 3.2. (a) If it happens to be true that there exists a unique optimal strategy «o f° r K, 
then Theorem 13. II implies that 

a n — > ao as n — > oo. (3.1) 

(b) If p\ = ■ ■ ■ = pk, it is possible to choose the strategies a n to be symmetric. If so, and if it 
happens that there exists a unique symmetric optimal strategy ao f° r K, then (|3.ip holds. 



4 Asymptotically optimal play of Knock 'em Down 

The main result of this section (see Corollary 14. 3p is that when an optimal strategy ao for 
the continuous game K is "rounded" to produce allocations for n-token Knock 'em Down, the 
result is an asymptotically optimal strategy. The drawback here is that we do not know how to 
construct such an ao, but we might at least hope to find a not unreasonably suboptimal d for K, 
such as the one discussed in Remark 14.21 We are thus motivated to show more generally (see 
Theorem 14. ip that "rounding" of any strategy d with atomless marginals and bounded support 
gives a strategy for Knock 'em Down whose worst-case payoff (i.e., payoff against an opponent 
playing the best possible response) is asymptotically at least as large as the worst-case payoff in 
game K from use of d. 

Theorem 4.1. If a n — > a, where a has atomless marginals and bounded support, then 

liminf min K n (a n ,y) > infi^(d,y). (4-1) 

n->oo y £ A n y 
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Proof. Let us define 

K n := min K n (a n ,y), (4.2) 
yeA n 

and let y n G A n be a strategy achieving this minimum. Since the sequence (a n ) converges 
weakly, it is tight, so by Lemma ll.ll y n remains bounded. Let ni f oo be any sequence for which 
lim^oo n ni = liminfn^oo n n . By compactness, we know that there is a subsequence hi \ oo and 
a continuous-game allocation y such that — ► y. 

The supremum of \K n (x,y) — K(x,y)\ over any bounded set of (x,y) tends to as n — > oo; in 
particular, ties do not cause a problem here. Since the sequence (a n ) is tight and the sequence 
y n is bounded, we have that n n differs by oil) from 

k n := K{a n ,y n ). 

For e > define 

F £ := {x : |xj — yi\ < £ for some i}, 
and let i 7 ^ denote the complement of this set. We have that 

K(x, yn e ) -> i^(x, y) as £ -> oo 
uniformly for x £ F^. Thus, fixing e > 0, 

> / K(x,y)an e (dx) - |a ft< (F e ) - o(l). 

Now, by an argument (omitted here) very much like one used in the detailed proof of Theorem l3.ll 
one finds 

lim k he > K{a,y) - 2a(F £ ). 
Letting e J. and using the continuity of the marginals of a, we obtain 

liminf K n = liminf k n = lim > K(a,y) > inf K(a,y). □ 

n^oo n^oo £~ >oo y 

Remark 4.2. Even very simple-minded mixed strategies can offer substantial improvement 
over pure strategies. To illustrate this, we consider k = 3 and p = (1/3,1/3,1/3). Numerical 
experimentation suggested that the uniform distribution a over the simplex of all triples x = 
(xi,X2,xs) summing to zero and satisfying Xi > —1/6 for all i might be a good strategy for 
the continuous game. Indeed, numerical explorations indicate that the best response to a is the 
pure strategy (0, 0, 0) and thence that 

RHSgU) = -0.0101219 

[a huge improvement on the worst-payoff value — 1/6 = —0.1666667 resulting from use of the 
naive pure strategy (0,0,0) in place of a]. The strategy a was "rounded" to a strategy aiso for 
180-chip Knock 'em Down by taking aiso to be uniform over allocations placing at least 58 chips 
in each bin. [Note 60 — (l/6)\/l80 = 57.8.] Then, according to Mathematica, the value of k±so 
at flM]) is 

Kl80 = -0.0165257, 

improving on the worst possible payoff of = —0.0920653 for the naive allocation (60, 60, 60). 

Corollary 4.3. If a n — > ao, where ao is optimal for the continuous game K, then a n is 
asymptotically optimal for the n-token Knock 'em Down game K n , in the sense that n n defined 
at (I4.2j) vanishes in the limit. 
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5 Open problems 



We have proved the existence of an optimal strategy for two-player continuous Knock 'em Down, 
but we do not have an explicit description of optimal play even when k = 3 and pi = P2 = P3 = 
1/3. We know that the marginal distributions of optimal play are absolutely continuous with 
respect to Lebesgue measure. Consequently the set of pure strategies supporting optimal play 
will have dimension at least 1; perhaps the dimension is k — 1. 
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